
revi sed . SUB . SEQUENCE . LST . 0120 
SUBSTITUTE SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: PULST, STEFAN M. 

(ii) TITLE OF INVENTION: NUCLEIC ACID ENCODING SPINOCEREBELLAR 
ATAXIA-2 AND PRODUCTS RELATED THERETO 

(iii) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 0 

(A) ADDRESSEE: MUETING, RAASCH & GEBHARDT, P.A. HbOC VCD 

(B) STREET: 119 NORTH FOURTH STREET, SUITE 203 ' V 

(C) CITY: MINNEAPOLIS CCD 9 H OftM 

(D) STATE: MINNESOTA DCr^U^UUI 

8 2T£ior 01 TECH CENTER 1600/2900 

(V) COMPUTER READABLE FORM: 

(A) medium TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS . 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/981,998 

(B) FILING DATE: ll-MAY-1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: WO 97/42314 

(B) FILING DATE: 08-MAY-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/727,084 

(B) FILING DATE: 08-OCT-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/022,207 

(B) FILING DATE: 19-3UL-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/017,388 

(B) FILING DATE: 08-MAY-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: MUETING, ANN M. 

(B) REGISTRATION NUMBER: 33,977 

(C) REFERENCE/DOCKET NUMBER: 232.00010120 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-305-1217 

(B) TELEFAX: 612-305-1228 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
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revi sed . SUB. SEQUENCE . LST . 0120 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



GCC GCA GCC AGG TGG CCC GGG TGG CGC TCG CTC CAG CGG CCG GCG CGG 

Ala Ala Ala Arg Trp Pro Gly Trp Arg Ser Leu Gin Arg Pro Ala Arg 
25 30 35 

CGG AGC GGG CGG GGC GGC GGT GGC GCG GCC CCG GGA CCG TAT CCC TCC 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TTGGTAGCAA CGGAAACGGC GGCGGCGCGT TTCGGCCCGG CTCCCGGCGG CTCCTTGGTC 60 

TCGGCGGGCC TCCCCGCCCC TTCGTCGTCG TCCTTCTCCC CCTCGCCAGC CCGGGCGCCC 120 

CTCCGGCCGC GCCAACCCGC GCCTCCCCGC TCGGCGCCCG TGCGTCCCCG CCGCGTTCCG 180 

GCGTCTCCTT GGCGCGCCCG GCTCCCGGCT GTCCCCGCCC GGCGTGCGAG CCGGTGTATG 240 

GGCCCCTCAC CATGTCGCTG AAGCCCCAGC AGCAGCAGCA GCAGCAGCAG CAACAGCAGC 300 

AGCAGCAACA GCAGCAGCAG CAGCAGCAGC AGCCGCCGCC CGCGGCTGCC AATGTCCGCA 360 

AGCCCGGCGG CAGCGGCCTT CTAGCGTCGC CCGCCGCCGC GCCTTCGCCG TCCTCGTCCT 420 

CGGTCTCCTC GTCCTCGGCC ACGGCTCCCT CCTCGGTGGT CGCGGCGACC TCCGGCGGCG 480 

GGAGGCCCGG CCTGGGCAGG TGGGTGTCGG CACCCC 516 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) topology: both 
(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 163.. 4101 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ACCCCCGAGA AAGCAACCCA GCGCGCCGCC CGCTCCTCAC GTGTCCCTCC CGGCCCCGGG 60 

GCCACCTCAC GTTCTGCTTC CGTCTGACCC CTCCGACTTC CGGTAAAGAG TCCCTATCCG 120 

CACCTCCGCT CCCACCCGGC GCCTCGGCGC GCCCGCCCTC CG ATG CGC TCA GCG 174 

Met Arg Ser Ala 
1 

GCC GCA GCT CCT CGG AGT CCC GCG GTG GCC ACC GAG TCT CGC CGC TTC 222 

Ala Ala Ala Pro Arg Ser Pro Ala Val Ala Thr Glu Ser Arg Arg Phe 
5 10 15 20 



270 



318 



revi sed . sub . sequence . lst. 0120 
Arg ser Gly Arg Gly Gly Gly Gly Ala Ala Pro Gly Pro Tyr Pro Ser 
40 45 J u 

GCC GCC CCT CCC CCG CCC GGC CCC GGC CCC CCT CCC TCC CGG CAG AGC 

Ala Ala Pro Pro Pro Pro Gly Pro Gly Pro Pro Pro ser Arg Gin Ser 
55 60 65 

TCG CCT CCC TCC GCC TCA GAC TGT TTT GGT AGC AAC GGC AAC GGC GGC 

Ser Pro Pro Ser Ala ser Asp Cys Phe Gly Ser Asn Gly Asn Gly Gly 
70 75 80 

GGC GCG TTT CGG CCC GGC TCC CGG CGG CTC CTT GOT CTC GGC GGG CCT 
Gly Ala Phe Arg Pro Gly Ser Arg Arg Leu Leu Gly Leu Gly Gly Pro 
85 90 95 100 

CCC CGC CCC TTC GTC GTC GTC CTT CTC CCC CTC GCC AGC CCG GGC GCC 
pro Arq Pro Phe val val Val Leu Leu Pro Leu Ala ser Pro Gly Ala 
105 HO llb 

CCT CCG GCC GCG CCA ACC CGC GCC TCC CCG CTC GGC GCC CGT GCG TCC 

Pro Pro Ala Ala Pro Thr Arg Ala ser Pro Leu Gly Ala Arg Ala Ser 
120 125 130 

CCG CCG CGT TCC GGC GTC TCC TTG GCG CGC CCG GOT CCC GGC TGT CCC 

Pro Pro Arq ser Gly val Ser Leu Ala Arg Pro Ala Pro Gly Cys Pro 
135 140 145 

CGC CCG GCG TGC GAG CCG GTG TAT GGG CCC CTC ACC ATG TCG CTG AAG 
Arq Pro Ala Cys Glu Pro val Tyr Gly Pro Leu Thr Met Ser Leu Lys 
150 155 160 

CCC CAG CAG CAG CAG CAG CAG CAG CAG CAA CAG CAG CAG CAG CAA CAG 
Pro Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
165 170 175 180 

CAG CAG CAG CAG CAG CAG CAG CCG CCG CCC GCG GCT GCC AAT OTC CGC 
Gin Gin Gin Gin Gin Gin Gin Pro Pro Pro Ala Ala Ala Asn yal Arg 
185 190 195 

AAG CCC GGC GGC AGC GGC CTT CTA GCG TCG CCC GCC GCC GCG CCT TCG 

Lys Pro Gly Gly Ser Gly Leu Leu Ala Ser Pro Ala Ala Ala Pro ser 
200 205 210 

CCG TCC TCG TCC TCG GTC TCC TCG TCC TCG GCC ACG GCT CCC TCC TCG 

Pro ser Ser ser Ser val ser Ser Ser ser Ala Thr Ala Pro Ser Ser 
215 220 225 



GTG GTC GCG GCG ACC TCC GGC GGC GGG AGG CCC GGC CTG GGC AGA GOT 
val val Ala Ala Thr ser Gly Gly Gly Arg Pro Gly Leu Gly Arg Gly 
230 235 240 

CGA AAC ACT AAC AAA GGA CTG CCT CAG TCT ACG ATT TCT TTT GAT GGA 

Arg Asn Ser Asn Lys Gly Leu Pro Gin Ser Thr He Ser Phe Asp Gly 
245 250 255 260 

ATC TAT GCA AAT ATG AGG ATG GTT CAT ATA CTT ACA TCA GTT GTT GGC 

He Tyr Ala Asn Met Arg Met Val His lie Leu Thr Ser val val Gly 
265 270 275 

TCC AAA TGT GAA GTA CAA GTG AAA AAT GGA GOT ATA TAT GAA GGA GTT 

Ser Lys Cys Glu val Gin val Lys Asn Gly Gly He Tyr Glu Gly val 
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revi sed . SUB . SEQUENCE . LST. 0120 
280 285 290 

TTT AAA ACT TAC AGT CCG AAG TGT GAT TTG GTA CTT GAT GCC GCA CAT 1086 

Phe Lvs Thr Tyr Ser Pro Lys Cys Asp Leu Val Leu Asp Ala Ala His 
295 300 305 

GAG AAA AGT ACA GAA TCC AGT TCG GGG CCG AAA CGT GAA GAA ATA ATG 1134 

Glu Lys Ser Thr Glu ser ser Ser Gly Pro Lys Arg Glu Glu lie Met 
310 315 320 

GAG AGT ATT TTG TTC AAA TGT TCA GAC TTT GTT GTG GTA CAG TTT AAA 1182 

Glu ser lie Leu Phe Lys Cys Ser Asp Phe Val Val Val Gin Phe Lys 
325 330 335 340 

GAT ATG GAC TCC AGT TAT GCA AAA AGA GAT GCT TTT ACT GAC TCT GCT 1230 

Asp Met Asp Ser Ser Tyr Ala Lys Arg Asp Ala Phe Thr Asp ser Ala 
345 350 355 

ATC AGT GCT AAA GTG AAT GGC GAA CAC AAA GAG AAG GAC CTG GAG CCC 1278 
lie Ser Ala Lys val Asn Gly Glu His Lys Glu Lys Asp Leu Glu Pro 
360 365 370 

TGG GAT GCA GGT GAA CTC ACA GCC AAT GAG GAA CTT GAG GCT TTG GAA 1326 
Tro Asp Ala Gly Glu Leu Thr Ala Asn Glu Glu Leu Glu Ala Leu Glu 
375 380 385 

AAT GAC GTA TCT AAT GGA TGG GAT CCC AAT GAT ATG TTT CGA TAT AAT 1374 

Asn Asp Val Ser Asn Gly Trp Asp Pro Asn Asp Met Phe Arg Tyr Asn 
390 395 400 

GAA GAA AAT TAT GGT GTA GTG TCT ACG TAT GAT AGC AGT TTA TCT TCG 1422 

Glu Glu Asn Tyr Gly val val Ser Thr Tyr Asp Ser Ser Leu Ser Ser 
405 410 415 420 

TAT ACA GTG CCC TTA GAA AGA GAT AAC TCA GAA GAA TTT TTA AAA CGG 1470 

Tyr Thr val Pro Leu Glu Arg Asp Asn Ser Glu Glu Phe Leu Lys Arg 
425 430 435 

GAA GCA AGG GCA AAC CAG TTA GCA GAA GAA ATT GAG TCA AGT GCC CAG 1518 

Glu Ala Arg Ala Asn Gin Leu Ala Glu Glu lie Glu Ser Ser Ala Gin 
440 445 450 



TAC AAA GCT CGA GTG GCC CTG GAA AAT GAT GAT AGG AGT GAG GAA GAA 1566 

Tyr Lys Ala Arg Val Ala Leu Glu Asn Asp Asp Arg ser Glu Glu Glu 
455 460 465 

AAA TAC ACA GCA GTT CAG AGA AAT TCC AGT GAA CGT GAG GGG CAC AGC 1614 
Lys Tyr Thr Ala val Gin Arg Asn Ser Ser Glu Arg Glu Gly His Ser 
470 475 480 

ATA AAC ACT AGG GAA AAT AAA TAT ATT CCT CCT GGA CAA AGA AAT AGA 1662 

lie Asn Thr Arg Glu Asn Lys Tyr He Pro Pro Gly Gin Arg Asn Arg 
485 490 495 500 

GAA GTC ATA TCC TGG GGA AGT GGG AGA CAG AAT TCA CCG CGT ATG GGC 1710 

Glu val lie Ser Trp Gly Ser Gly Arg Gin Asn Ser Pro Arg Met Gly 
505 510 515 



CAG CCT GGA TCG GGC TCC ATG CCA TCA AGA TCC ACT TCT CAC ACT TCA 

Gin Pro Gly Ser Gly ser Met Pro Ser Arg Ser Thr Ser His Thr Ser 
520 525 530 
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revi sed . sub . sequence . lst. 0120 

GAT TTC AAC CCG AAT TCT GGT TCA GAC CAA AGA GTA GTT AAT GGA GOT 

Asp Phe Asn Pro Asn Ser Gly Ser Asp Gin Arg Val val Asn Gly Gly 
535 540 545 

GTT CCC TGG CCA TCG CCT TGC CCA TCT CCT TCC TCT CGC CCA CCT TCT 

val Pro Trp Pro Ser Pro Cys Pro ser Pro ser Ser Arg Pro Pro ser 
550 555 560 

CGC TAC CAG TCA GGT CCC AAC TCT CTT CCA CCT CGG GCA GCC ACC CCT 

Arq Tyr Gin Ser Gly Pro Asn Ser Leu Pro Pro Arg Ala Ala Thr Pro 
565 570 575 580 

ACA CGG CCG CCC TCC AGG CCC CCC TCG CGG CCA TCC AGA CCC CCG TCT 
Thr Arg Pro Pro Ser Arg Pro Pro ser Arg Pro ser Arg Pro Pro Ser 
585 590 ->=*■> 

CAC CCC TCT GCT CAT GGT TCT CCA GCT CCT GTC TCT ACT ATG CCT AAA 
His Pro ser Ala His Gly ser Pro Ala Pro val Ser Thr Met Pro Lys 
600 605 610 

CGC ATG TCT TCA GAA GGG CCT CCA AGG ATG TCC CCA AAG GCC CAG CGA 

Arq Met Ser ser Glu Gly Pro Pro Arg Met Ser Pro Lys Ala Gin Arg 
y 615 620 625 

CAT CCT CGA AAT CAC AGA GTT TCT GCT GGG AGG GOT TCC ATA TCC AGT 

His Pro Arg Asn His Arg val Ser Ala Gly Arg Gly Ser He Ser Ser 
630 635 640 

GGC CTA GAA TTT GTA TCC CAC AAC CCA CCC AGT GAA GCA GCT ACT CCT 
Glv Leu Glu Phe val ser His Asn Pro Pro Ser Glu Ala Ala Thr Pro 
645 650 655 660 

CCA GTA GCA AGG ACC AGT CCC TCG GGG GGA ACG TGG TCA TCA GTG GTC 
pro val Ala Arg Thr Ser Pro Ser Gly Gly Thr Trp ser Ser val val 
665 670 675 



AGT GGG GTT CCA AGA TTA TCC CCT AAA ACT CAT AGA CCC AGG TCT CCC 

Ser Gly val Pro Arg Leu Ser Pro Lys Thr His Arg Pro Arg ser Pro 
680 685 690 

AGA CAG AAC AGT ATT GGA AAT ACC CCC AGT GGG CCA GTT CTT GCT TCT 

Arg Gin Asn Ser He Gly Asn Thr Pro ser Gly Pro val Leu Ala Ser 
s 695 700 705 

CCC CAA GCT GGT ATT ATT CCA ACT GAA GCT GTT GCC ATG CCT ATT CCA 
Pro Gin Ala Gly He He Pro Thr Glu Ala val Ala Met Pro He Pro 
710 715 720 

GCT GCA TCT CCT ACG CCT GCT AGT CCT GCA TCG AAC AGA GCT GTT ACC 
Ala Ala ser Pro Thr Pro Ala ser Pro Ala Ser Asn Arg Ala Val Thr 
725 730 735 740 

CCT TCT AGT GAG GCT AAA GAT TCC AGG CTT CAA GAT CAG AGG CAG AAC 

pro ser Ser Glu Ala Lys Asp Ser Arg Leu Gin Asp Gin Arg Gin Asn 
745 750 '->-> 

TCT CCT GCA GGG AAT AAA GAA AAT ATT AAA CCC AAT GAA ACA TCA CCT 

Ser Pro Ala Gly Asn Lys Glu Asn He Lys Pro Asn Glu Thr Ser Pro 
760 765 770 
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revi sed . SUB. SEQUENCE. LST. 0120 
ser Ala Met Met His Pro Ala Ser Ala Ala Gly Pro Pro lie Ala Ala 
1015 1020 1025 

ACC CCA CCA GCT TAC TCC ACG CAA TAT GTT GCC TAC AGT CCT CAG CAG 

Thr Pro Pro Ala Tyr ser Thr Gin Tyr val Ala Tyr Ser Pro Gin Gin 
1030 1035 1040 



CAA CAG ACT GTC TTT ACG ATC CAT CCI ICI lal ^ ^ 

Gin Gin Thr Val Phe Thr He His Pro ser His val Gin Pro Ala Tyr 

1225 1230 J-"-> 

ACC AAC CCA CCC CAC ATG GCC CAC GTA CCT CAG GCT CAT GTA CAG TCA 

Thr Asn Pro Pro His Met Ala his val Pro Gin Ala his val Gin ser 
1240 1245 J-" u 

rr\ 4Tf TTT fCT TCT CAT CCA ACT GCC CAT GCG CCA ATG ATG CTA ATG 

Gl? iS VaT Pro S?r Sil Pro Thr Ala His Ala Pro Met Met Leu Met 
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3294 



3342 



3390 



3438 



3486 



TTC CCA AAT CAG CCC CTT GTT CAG CAT GTG CCA CAT TAT CAG TCT CAG 

Jhe SJ ill Gin Pro Leu Val Gin His val Pro His Tyr Gin Ser Gin 
1045 1050 1055 -W°u 

CAT CCT CAT GTC TAT AGT CCT GTA ATA CAG GGT AAT GCT AGA ATG ATG 
His Pro His val Tyr ser Pro val He Gin Gly Asn Ala Arg Met Met 
1065 1070 

GCA CCA CCA ACA CAC GCC CAG CCT GGT TTA GTA TCT TCT TCA GCA ACT 
Alt Pro" Th? His Ala Gin Pro Gly Leu Val ser Ser Ser Ala Thr 
1080 1085 loyu 

TAG TAC GGG GCT CAT GAG CAG ACG CAT GCG ATG TAT GCA TGT CCC AAA 

Gin Tyr Gly Ala Sis Glu Gin Thr His Ala Met Tyr Ala cys Pro Lys 
1095 1100 110 b 

TTA CCA TAC AAC AAG GAG ACA AGC CCT TCT TTC TAC TTT GCC ATT TCC 3534 

US PrS Tyr "n Lys Glu Thr Ser Pro Ser Phe Tyr Phe Ala He Ser 
1110 1115 ilzu 

ACG GGC TCC CTT GCT CAG CAG TAT GCG CAC CCT AAC GCT ACC CTG CAC 3582 
Thr Gly ser Leu Ala Gin Gin Tyr Ala His Pro Asn Ala Thr Leu his 
1125 H30 H35 114U 

CCA CAT ACT CCA CAC CCT CAG CCT TCA GCT ACC CCC ACT GGA CAG CAG 
Pro His Thr Pro His Pro Gin Pro ser Ala Thr Pro Thr Gly Gin Gin 
1145 1150 J- 1 -" 

CAA AGC CAA CAT GGT GGA AGT CAT CCT GCA CCC AGT CCT GTT CAG CAC 

GlS ser Gin His Gly Gly Ser His Pro Ala Pro Ser Pro val Gin His 

1160 ll 65 11/u 

CAT CAG CAC CAG GCC GCC CAG GCT CTC CAT CTG GCC AGT CCA CAG CAG 3726 

Sis Gin His Gin Ala Ala Gin Ala Leu His Leu Ala ser Pro Gin Gin 
1175 1180 1183 

CAG TCA GCC ATT TAC CAC GCG GGG CTT GCG CCA ACT CCA CCC TCC ATG 3774 
Girl ser Ala ill Tyr His Ala Gly Leu Ala Pro Thr Pro Pro ser Met 
1190 1195 1200 

ACA CCT GCC TCC AAC ACG CAG TCG CCA CAG AAT AGT TTC CCA GCA GCA 
Thf Pro Ala Ser Asn Thr Gin ser Pro Gin Asn ser Phe Pro Ala Ala 
1205 1210 1215 1ZZU 

CAA CAG ACT GTC TTT ACG ATC CAT CCT TCT CAC GTT CAG CCG GCG TAT 

Phe 1' 
1225 



3630 



3678 



3822 



3870 



3918 



3966 



revi sed . SUB . SEQUENCE . LST . 0120 
12 55 1260 1265 

s? s? as & as §5 as ss ss ess as sll ss 



1270 



gS £ ffi SS SS » » ffi ffi BS S $ SS £ 

1285 1290 J-"-* 

CAC CCT TCA GTA CAA 6CC CAC CAC CAA CAG CAG TTG TAA GGCTGCCCTG 
His Pro sen val Gin Ala His His Gin Gin Gin Leu 
1305 1310 

GAGGAACCGA AAGGCCAAAT TCCCTCCTCC CTTCTACTGC TTCTACCAAC TGGAAGCACA 
GAAAACTAGA ATTTCATTTA TTTTGTTTTT AAAATATATA TGTTGATTTC TTGTAACATC 
CAATAGGAAT GCTAACAGTT CACTTGCAGT GGAAGATACT TGGACCGAGT AGAGGCATTT 
AGGAACTTGG GGGCTATTCC ATAATTCCAT ATGCTGTTTC AGAGTCCCGC AGGTACCCCA 
GCTCTGCTTG CCGAAACTGG AAGTTATTTA TTTTTTAATA ACCCTTGAAA GTCATGAACA 
CATCAGCTAG CAAAAGAAGT AACAAGAGTG ATTCTTGCTG CTATTACTGC TAAAAAAAAA 
AAAAAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 3: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 amino acids 

(B) type: amino acid 
CD) topology: linear 

(ii) MOLECULE type: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Arg Ser Ala Ala Ala Ala Pro Arg Ser Pro Ala val Ala Thr Glu 

1 5 1U 

ser Arg Arg Phe Ala Ala Ala Arg Trp Pro Gly Trp Arg Ser Leu Gin 
20 25 3 

Arg Pro Ala Arg Arg Ser Gly Arg Gly Gly Gly Gly Ala Ala Pro Gly 

35 40 45 

Pro Tyr Pro ser Ala Ala Pro Pro Pro Pro Gly Pro Gly Pro Pro Pro 



50 



ser Arg Gin ser ser Pro Pro ser Ala ser Asp cys Phe Gly ser Asn 

65 70 
Gly Asn Gly Gly Gly Ala Phe Arg Pro Gly Ser Arg Arg Leu Leu Gly 

85 

Leu Gly Gly Pro Pro Arg Pro Phe val val val Leu Leu Pro Leu Ala 
100 105 

Ser Pro Gly Ala Pro Pro Ala Ala Pro Thr Arg Ala ser Pro Leu Gly 
it; 120 



4014 

4062 

4111 

4171 
4231 
4291 
4351 
4411 
4471 
4481 
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Ala Arg Ala ser Pro Pro Arg Ser Gly val Ser Leu Ala Arg Pro Ala 
130 135 140 

Pro Gly cys Pro Arg Pro Ala Cys Glu Pro val Tyr Gly Pro Leu Thr 
145 150 155 lb" 

Met Ser Leu Lys Pro Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
165 170 17b 

Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Pro Pro Pro Ala Ala 
180 185 iyu 

Ala Asn val Arg Lys Pro Gly Gly Ser Gly Leu Leu Ala Ser Pro Ala 
195 200 

Ala Ala Pro ser Pro Ser Ser Ser ser val ser Ser Ser Ser Ala Thr 
210 215 220 

Ala Pro ser Ser Val val Ala Ala Thr Ser Gly Gly Gly Arg Pro Gly 
225 230 235 ^ 

Leu Gly Arg Gly Arg Asn ser Asn Lys Gly Leu Pro Gin Ser Thr lie 
245 250 "3 

ser Phe Asp Gly He Tyr Ala Asn Met Arg Met val His lie Leu Thr 
260 265 27U 

ser val val Gly ser Lys Cys Glu val Gin val Lys Asn Gly Gly He 
275 280 285 

Tyr Glu Gly val Phe Lys Thr Tyr ser Pro Lys Cys Asp Leu val Leu 
290 295 300 

Asp Ala Ala His Glu Lys Ser Thr Glu ser ser Ser Gly Pro Lys Arg 
305 310 315 32U 

Glu Glu He Met Glu Ser He Leu Phe Lys Cys Ser Asp Phe Val Val 
325 330 3ib 

val Gin Phe Lys Asp Met Asp ser Ser Tyr Ala Lys Arg Asp Ala Phe 
340 345 3bU 

Thr Asp ser Ala lie Ser Ala Lys Val Asn Gly Glu His Lys Glu Lys 
355 360 365 

Asp Leu Glu Pro Trp Asp Ala Gly Glu Leu Thr Ala Asn Glu Glu Leu 
370 375 380 

Glu Ala Leu Glu Asn Asp val Ser Asn Gly Trp Asp Pro Asn Asp Met 
385 390 395 

Phe Arg Tyr Asn Glu Glu Asn Tyr Gly val val ser Thr Tyr Asp Ser 
405 410 41 -> 

ser Leu Ser ser Tyr Thr val Pro Leu Glu Arg Asp Asn Ser Glu Glu 
420 425 4 3U 

Phe Leu Lys Arg Glu Ala Arg Ala Asn Gin Leu Ala Glu Glu He Glu 
435 440 445 

ser Ser Ala Gin Tyr Lys Ala Arg val Ala Leu Glu Asn Asp Asp Arg 
450 4 55 4 60 
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revi sed . SUB . SEQUENCE . LST. 0120 

ser Glu Glu Glu Lys Tyr Thr Ala Val Gin Arg Asn Ser ser Glu Arg 
465 470 475 480 

Glu Gly His Ser lie Asn Thr Arg Glu Asn Lys Tyr He Pro Pro Gly 
485 490 495 

Gin Arg Asn Arg Glu val He ser Trp Gly ser Gly Arg Gin Asn ser 
500 505 510 

pro Arq Met Gly Gin Pro Gly ser Gly Ser Met Pro Ser Arg Ser Thr 
515 520 525 

ser His Thr Ser Asp Phe Asn Pro Asn Ser Gly ser Asp Gin Arg Val 
530 535 540 

Val Asn Gly Gly Val Pro Trp Pro ser Pro Cys Pro Ser Pro Ser Ser 
545 550 555 560 

Arq Pro Pro Ser Arg Tyr Gin Ser Gly Pro Asn ser Leu Pro Pro Arg 
565 570 575 

Ala Ala Thr Pro Thr Arg Pro Pro ser Arg Pro Pro Ser Arg Pro Ser 
580 585 590 

Arq Pro Pro Ser His Pro Ser Ala His Gly Ser Pro Ala Pro val Ser 
595 600 605 

Thr Met Pro Lys Arg Met Ser Ser Glu Gly Pro Pro Arg Met Ser Pro 
610 615 620 

Lys Ala Gin Arg His Pro Arg Asn His Arg Val ser Ala Gly Arg Gly 
625 630 635 640 

Ser lie ser Ser Gly Leu Glu Phe Val ser His Asn Pro Pro Ser Glu 
645 650 655 

Ala Ala Thr Pro Pro val Ala Arg Thr ser Pro Ser Gly Gly Thr Trp 
660 665 670 

ser Ser Val val Ser Gly val Pro Arg Leu Ser Pro Lys Thr His Arg 
675 680 685 

Pro Arg Ser Pro Arg Gin Asn Ser lie Gly Asn Thr Pro Ser Gly Pro 
690 695 700 

val Leu Ala ser Pro Gin Ala Gly He lie Pro Thr Glu Ala Val Ala 
705 710 715 720 

Met Pro He Pro Ala Ala Ser Pro Thr Pro Ala Ser Pro Ala Ser Asn 
725 730 735 

Arg Ala val Thr Pro Ser ser Glu Ala Lys Asp Ser Arg Leu Gin Asp 
y 740 745 750 

Gin Arg Gin Asn Ser Pro Ala Gly Asn Lys Glu Asn lie Lys Pro Asn 
y 755 760 765 

Glu Thr ser Pro ser Phe ser Lys Ala Glu Asn Lys Gly He Ser Pro 
770 775 780 

val val Ser Glu His Arg Lys Gin He Asp Asp Leu Lys Lys Phe Lys 
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revi sed . SUB . sequence . LST. 0120 
785 790 795 800 

Asn Asp Phe Arg Leu Gin Pro Ser ser Thr ser Glu Ser Met Asp Gin 
805 ° 1U 

Leu Leu Asn Lys Asn Arg Glu Gly Glu Lys Ser Arg Asp Leu He Lys 
820 825 0iV 

Asp Lys He Glu Pro ser Ala Lys Asp Ser Phe He Glu Asn Ser ser 

835 840 

ser Asn Cys Thr Ser Gly Ser ser Lys Pro Asn ser Pro Ser He ser 
850 855 860 

Pro ser He Leu Ser Asn Thr Glu His Lys Arg Gly Pro Glu Val Thr 
865 870 875 

ser Gin Gly val Gin Thr Ser ser Pro Ala cys Lys Gin Glu Lys Asp 
885 890 ° y:> 

Asp Lys Glu Glu Lys Lys Asp Ala Ala Glu Gin val Arg Lys ser Thr 
900 905 y 

Leu Asn Pro Asn Ala Lys Glu Phe Asn Pro Arg Ser Phe ser Gin Pro 
915 92° y " 

Lys Pro ser Thr Thr Pro Thr ser Pro Arg Pro Gin Ala Gin Pro ser 
930 935 yw 

Pro ser Met Val Gly His Gin Gin Pro Thr Pro val Tyr Thr Gin Pro 
945 950 y " 

val cys Phe Ala Pro Asn Met Met Tyr Pro val Pro Val ser Pro Gly 



965 



val Gin Pro Leu Tyr Pro He Pro Met Thr Pro Met Pro val Asn Gin 
980 985 yyu 

Ala Lys Thr Tyr Arg Ala val Pro Asn Met Pro Gin Gin Arg Gin Asp 

995 1000 j-uud 

Gin His His Gin ser Ala Met Met His Pro Ala ser Ala Ala Gly Pro 

1010 1015 1° 2U 

Pro He Ala Ala Thr Pro Pro Ala Tyr Ser Thr Gin Tyr val Ala Tyr^ 

1025 1030 5 

ser Pro Gin Gin Phe Pro Asn Gin Pro Leu val Gin His val Pro His 
1045 1050 j-ujj 

Tyr Gin Ser Gin His Pro His val Tyr ser Pro val He Gin Gly Asn 

1060 1065 ±u 

Ala Arg Met Met Ala Pro Pro Thr His Ala Gin Pro Gly Leu val Ser 

" " 1080 j- i/o j 



1075 



ser ser Ala Thr Gin Tyr Gly Ala His Glu Gin Thr His Ala Met Tyr 

1090 1095 H°° 

Ala cys Pro Lys Leu Pro Tyr Asn Lys Glu Thr ser Pro Ser Phe Tyr 
1105 111° 
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revi sed . SUB . SEQUENCE . LST. 0120 
Phe Ala He Ser Thr Gly ser Leu Ala Gin Gin Tyr Ala His Pro Asn 
1125 1130 J. J. j j 

Ala Thr Leu His Pro His Thr Pro His Pro Gin Pro ser Ala Thr Pro 
1140 1145 J " L:)U 

Thr Gly Gin Gin Gin ser Gin His Gly Gly Ser His Pro Ala Pro Ser 
1155 1160 Ho5 

Pro val Gin His His Gin His Gin Ala Ala Gin Ala Leu His Leu Ala 
1170 1175 lloO 

Ser Pro Gin Gin Gin Ser Ala lie Tyr His Ala Gly Leu Ala Pro Thr 
1185 1190 1195 l^OU 

Pro Pro Ser Met Thr Pro Ala Ser Asn Thr Gin Ser Pro Gin Asn Ser 
1205 1210 l^ i!5 

Phe Pro Ala Ala Gin Gin Thr val Phe Thr He His Pro Ser His val 
1220 1225 !" u 

Gin Pro Ala Tyr Thr Asn Pro Pro His Met Ala His val Pro Gin Ala 
1235 1240 1245 

His val Gin Ser Gly Met val Pro Ser His Pro Thr Ala His Ala Pro 
1250 1255 1260 

Met Met Leu Met Thr Thr Gin Pro Pro Gly Gly Pro Gin Ala Ala Leu 
1265 1270 1275 1«0 

Ala Gin ser Ala Leu Gin Pro He Pro val Ser Thr Thr Ala His Phe 
1285 1290 J-^J 

pro Tyr Met Thr His Pro ser Val Gin Ala His His Gin Gin Gin Leu 
1300 1305 1310 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 50.. 3457 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGCACGAGGT CCCCGCCCGG CGTGCGAGCC GGTGTATGGG CCGCTCACC ATG TCG 55 

1 

CTG AAG CCG CAG CCG CAG CCG CCC GCG CCC GCC ACT GGC CGC AAG CCC 103 
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revi sed . SUB . SEQUENCE . LST . 0120 
Leu Lys Pro Gin Pro Gin Pro Pro Ala Pro Ala Thr Gly Arg Lys Pro 



5 



rrr CCC GGC CTG CTC TCG TCG CCC GGC GCC GCG CCG GCC TCG GCC GCG 
Gly Gly Gly SS Si sfr ser Pro Gly Ala Ala Pro Ala Ser Ala Ala 

20 25 iU 

CTC ACC TCG GCT TCC GTG GTG CCG GCC CCG GCC GCG CCG GTG GCG TCT 

SIl tS Hr All ser Sal val Pro Ala Pro Ala Ala Pro val Ala ser 
35 40 45 JU 

TCC TCG GCG GCC GCG GGC GGC GGG CGT CCC GGC CTG GGC AGA GGT CGG 

ser ser Ala Ala Ala Gly Gly Gly Arg Pro Gly Leu Gly Arg Gly Arg 



AAC AGT AGC AAA GGA CTG CCT CAG CCT ACG ATT TCT TTT GAT GGA ATC 
Ser ser Lys Gly Leu Pro Gin Pro Thr He ser Phe Asp Gly He 



TAT GCA AAC GTG AGG ATG GTT CAT ATA CTT ACG TCA GTT GTT GGA TCG 

jyr Ala Asn val Arg Met val His He Leu Thr Ser val Val Gly ser 

85 9° y:> 

AAA TGT GAA GTA CAA GTG AAA AAC GGA GGC ATA TAT GAA GGA GTT TTT 

lVs Cys Glu val Gin val Lys Asn Gly Gly He Tyr Glu Gly val Phe 
y 100 105 110 

AAA ACA TAC AGT CCT AAG TGT GAC TTG GTA CTT GAT GCT GCA CAT GAG 
Lys Thr Tyr ser Pro Lys Cys Asp Leu Val Leu Asp Ala Ala His Glu 

AAA AGT aca GAA TCC AGT TCG GGG CCA AAA CGT GAA GAA ATA ATG GAG 
t£ sfr Th? Gli sir ser ser Gly Pro Lys Arg Glu Glu He Met Glu 



180 



/-at GCA GGG GAG CTC ACG GCC AGC GAG GAG CTG GAG CTG GAG AAT GAT 
S£ Ala Gly GlS Leu Thr Ala Ser Glu Glu Leu Glu Leu Glu Asn Asp 
195 200 205 

CTC, TCT AAT GGA TGG GAC CCC AAT GAC ATG TTT CGA TAT AAT GAA GAG 
SIl S?r £1 Gl? Trp Asp Pro Asn Asp Met Phe Arg Tyr Asn Glu Glu 



AAT TAT GGT GTG GTG TCC ACA TAT GAT AGC AGT TTA TCT TCA TAT ACG 

ten Tyr Gly val val Ser Thr Tyr Asp Ser Ser Leu ser ser Tyr Thr 
230 235 

GTT CCT TTA GAA AGG GAC AAC TCA GAA GAA TTT CTT AAA CGG GAG GCA 

SIT Pro Leu Glu Arg Asp Asn Ser Glu Glu Phe Leu Lys Arg Glu Ala 
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151 



199 



247 



295 



343 



391 



439 



487 



535 



r -r-pc aaa TGC TCA GAC TTC GTT GTG GTA CAG TTT AAA GAT 

sS SIT IS Se i£ 5s ser Asp Phe val val val Gin Phe Lys Asp 
150 155 lbU 

ACA GAC TCC AGT TAT GCA CGG AGA GAT GCT TTT ACT GAC TCT GCT CTC 583 

Thr* Asp sir sfr Tyr Ala Arg Arg Asp Ala Phe Thr Asp ser Ala Leu 

165 I 70 1/5 

AGC GCA AAG GTG AAT GGT GAG CAC AAG GAG AAG GAC CTG GAG CCC TGG 631 

tfr Ala Lys Sal Asn Gly Glu His Lys Glu Lys Asp Leu Glu Pro Trp 
— 185 190 



679 



727 



775 



823 



revi sed . sub . sequence . lst . 0120 
245 250 255 

i?S Si iK 3S Si SK as ffi us s i - as S ® 

260 265 

rrx rrT rTC GCC CTT GAG AAT GAT GAC CGG AGT GAG GAA GAA AAA TAC 

All 1% vll Ala 21 Glu Asn Asp Asp Arg ser Glu Glu Glu Lys Tyr 
275 280 

sr . rrA rxr CAG AGA AAC TGC AGT GAC CGG GAG GGG CAT GGC CCC AAC 

Th? Al£ vll Gin A?g ash cys ser Asp Arg Glu Gly his Gly Pro Asn 
295 300 3UJ 

. rT arr rar AAT AAA TAT ATT CCT CCT GGA CAA AGA AAC AGA GAA GTC 
tS Arg Asp £1 tyl ™ ile Pro Pro Gly Gin Arg Asn Arg Glu Val 

310 315 
CTA TCC TGG GGA AGT GGG AGA CAG AGC TCA CCA CGG ATG GGC CAG CCT 

SS S?r J% Gl? ser Gly Arg Gin ser ser Pro Arg Met Gly Gin Pro 
325 330 a" 

GGG CCA GGC TCC ATG CCG TCA AGA GCT GCT TCT CAC ACT TCA GAT TTC 

Gly Pro Gly ser Met Pro ser Arg Ala Ala ser his Tnr ser as P 

340 345 " U 

AAC CCG AAC GCT GGC TCA GAC CAA AGA CTA CTT AAT GGA GCT GTT CCC 

Asn Pro Asn Ala Gly Ser Asp Gin Arg Val val Asn Gly Gly val Pro 
355 360 3C,:> 

-rrr rra Trr rrT TGC CCA TCT CAT TCC TCT CGC CCA CCT TCT CGC TAC 

Trp p C ro Ifr SJ cys So" ST His ser ser Arg Pro Pro ser Arg Tyr 

r Ar Tra CGT CCC AAC TCT CTT CCA CCT CGG GCA GCC ACC CAT ACA CGG 
Gin ser- GlJ So A^n S Leu Pro Pro Arg Ala Ala Thr His Thr Arg 
390 395 HUU 

rrr err TCC AGG CCC CCC TCG AGG CCA TCC AGA CCC CCG TCT CAC CCC 
Pro So sir A?g So Pro ser Arg Pro ser Arg Pro Pro ser ms Pro 

405 410 4 "° 

___ rrT rAT rrT TCT CCA GCT CCT GTC TCT ACT ATG CCT AAA CGC ATG 

HI All Sil Gly S?r So" All Pro val Ser Thr Met Pro Lys Arg Met 

425 



420 



TCT TCA GAA GGA CCC CCA AGG ATG TCT CCA AAG GCA CAG CGC CAC CCT 
ser ser Glu Gly Pro Pro Arg Met ser Pro Lys Ala Gin Arg His Pro 

440 44j ^ JV/ 



435 



rrr AAT CAC AGA GTC TCT GCT GGG AGA GGC TCC ATG TCT AGT GGC CTA 
Arg i£ His Arg val Ser Ala Gly Arg Gly ser Met ser ser Gly Leu 

rAA ttt CTA TCC CAC AAT CCC CCA AGT GAA GCA GCT GCT CCT CCA GTG 

Gli JE SIl IS Sis aII Pro Pro ser Glu Ala Ala Ala Pro Pro val 

470 475 48U 

TCA AGG ACC AGT CCT GCA GGG GGA ACG TGG TCC TCA GTG GTC AGT GGG 

Ala A% Thr sfr Si Ala Gly Gly Thr Trp Ser Ser val val Ser Gly 
485 490 Page 14 



871 
919 
967 
1015 
1063 
1111 
1159 
1207 
1255 
1303 
1351 
1399 

1447 
1495 
1543 



revi sed . sub . sequence . lst . 0120 

ST SS SS S IS SS SS SS SS « SS SJ IS SS S5 SS 

500 505 

SS £ SS » « IS SS SS 85 SS 1 S SS - SSS §}S 

515 520 

si as ss ss ss a as ss sit is ss ss sit ss ss as 

E £ SS SS SS SS SS SS I SS S3 - S SSS SS IS 

550 555 

ss ss a ss s is ss s as s as ss as sss is ss 

565 570 
580 585 

ss^aisjssss^ajssssjssgsrisasassj 

595 ° 00 

ssassssssssssijsiG^sssssissssas 



615 



ss ss is ss is as is ss sj as ss ss ss sj ss; ss 

630 635 

as w as sss is? ss ss s ss us ss $ sj as ss ss 

645 650 

ai & ss; ss us ss g ss ss ss ss ss ss sss si sss 



ss as ss ss sss ss SSS SS SS SS SS IS SS IS ss si 

ss ss; ^ as as sss ss as ss as ss ss? is as as si? 



675 

AAT GCA GAG CAC AAG AGG GGG CCT 
.-i, ^i.. His Lys Arg Gly Pro 

695 700 

as ss is ss ss ss iss sss as as sss ss; sss ss as as 



710 



sss sss sss sss ss? as as sit sss sss is ss? is ss; sss ss; 

725 730 /3J 



1591 



1639 



1687 



1735 



1783 



1831 



1879 



1927 



1975 



2023 



2071 



2119 



2167 



2215 



2263 



Page 15 



revi sed . SUB . SEQUENCE . LST . 0120 
Gin Gin Gin Ser Gin His Gly Gly ser His Pro Ala Pro Ser Pro Val 
980 985 990 

CAG CAC CAT CAG CAC CAG GCT GCC CAG GCT CTT CAT CTG GCC AGT CCA 3079 
Gin His His Gin His Gin Ala Ala Gin Ala Leu His Leu Ala Ser Pro 
995 1000 1005 1010 

CAG CAG CAG TCG GCC ATT TAT CAT GCG GGG CTG GCA CCA ACA CCA CCT 3127 
Gin Gin Gin Ser Ala lie Tyr His Ala Gly Leu Ala Pro Thr Pro Pro 
1015 1020 1025 

TCC ATG ACA CCT GCC TCT AAT ACA CAG TCT CCA CAG AGC AGT TTC CCA 3175 
Ser Met Thr Pro Ala Ser Asn Thr Gin Ser Pro Gin Ser Ser Phe Pro 
1030 1035 1040 

GCA GCA CAA CAG ACA GTC TTC ACC ATC CAC CCT TCT CAT GTT CAG CCG 3223 
Ala Ala Gin Gin Thr Val Phe Thr lie His Pro Ser His Val Gin Pro 
1045 1050 1055 

GCA TAC ACC ACC CCA CCC CAC ATG GCC CAC GTA CCT CAG GCT CAT GTA 3271 
Ala Tyr Thr Thr Pro Pro His Met Ala His val Pro Gin Ala His val 
1060 1065 1070 

CAG TCA GGA ATG GTT CCT TCT CAT CCA ACT GCC CAT GCG CCA ATG ATG 3319 
Gin Ser Gly Met Val Pro Ser His Pro Thr Ala His Ala Pro Met Met 
1075 1080 1085 1090 

CTA ATG ACG ACA CAG CCA CCC GGT CCC AAG GCC GCC CTC GCT CAA AGT 3367 
Leu Met Thr Thr Gin Pro Pro Gly Pro Lys Ala Ala Leu Ala Gin Ser 
1095 1100 1105 

GCA CTA CAG CCC ATT CCA GTT TCG ACA ACA GCG CAT TTC CCT TAT ATG 3415 
Ala Leu Gin Pro lie Pro val Ser Thr Thr Ala His Phe Pro Tyr Met 
1110 1115 1120 



ACG CAC CCT TCA GTA CAA GCC CAC CAC CAA CAG CAG TTG TAA 3457 
Thr His Pro Ser Val Gin Ala His His Gin Gin Gin Leu 
1125 1130 1135 

GGCTGCCTTG GAGGAACCGA AAGGCCAAAT CCCTTCTTCC CTTCTCTGCT TCTGCCAACC 3517 

GGAAGCACAG AAAACTAGAA CTTCATTGAT TTTG I I I I I I AAAAGATACA CTGATTTAAC 3577 

ATCTGATAGG AATGCTAACA GCTCACTTGC AGTGGAGGAT GTTTTGGACC GAGTAGAGGC 3637 

ATGTAGGGAC TTGTGGCTGT TCCATAATTC CATGTGCTGT TGCAGGGTCC TGCAAGTACC 3697 

CAGCTCTGCT TGCTGAAACT GGAAGTTATT TAI I I I I IAA TGGCCCTTGA GAGTCATGAA 3757 

CACATCAGCT AGCAACAGAA GTAACAAGAG TGATTCTTGC T 3798 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1135 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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revi sed . SUB . SEQUENCE . LST. 0120 
(xi) SEQUENCE description: seq id no: 5: 

Met ser Leu Lys Pro Gin Pro Gin Pro Pro Ala Pro Ala Thr Gly Arg 
15 10 15 

Lys Pro Gly Gly Gly Leu Leu Ser ser Pro Gly Ala Ala Pro Ala Ser 
20 25 30 

Ala Ala val Thr ser Ala Ser val Val Pro Ala Pro Ala Ala Pro val 
35 40 45 

Ala ser ser Ser Ala Ala Ala Gly Gly Gly Arg Pro Gly Leu Gly Arg 
50 55 60 

Gly Arg Asn Ser ser Lys Gly Leu Pro Gin Pro Thr lie Ser Phe Asp 
65 70 75 80 

Glv He Tyr Ala Asn val Arg Met Val His lie Leu Thr Ser val Val 
3 85 90 95 

Gly ser Lys Cys Glu val Gin val Lys Asn Gly Gly He Tyr Glu Gly 
100 105 110 

val Phe Lys Thr Tyr Ser Pro Lys Cys Asp Leu val Leu Asp Ala Ala 
115 120 125 

His Glu Lys Ser Thr Glu Ser ser Ser Gly Pro Lys Arg Glu Glu lie 
130 135 140 

Met Glu Ser Val Leu Phe Lys Cys ser Asp Phe val val val Gin Phe 
145 150 155 160 

Lys Asp Thr Asp ser Ser Tyr Ala Arg Arg Asp Ala Phe Thr Asp Ser 
y 165 170 175 

Ala Leu ser Ala Lys val Asn Gly Glu His Lys Glu Lys Asp Leu Glu 
180 185 190 

pro Trp Asp Ala Gly Glu Leu Thr Ala Ser Glu Glu Leu Glu Leu Glu 
195 200 205 

Asn Asp val ser Asn Gly Trp Asp Pro Asn Asp Met Phe Arg Tyr Asn 
210 215 220 

Glu Glu Asn Tyr Gly Val val Ser Thr Tyr Asp Ser Ser Leu Ser ser 
225 230 235 240 

Tvr Thr val Pro Leu Glu Arg Asp Asn Ser Glu Glu Phe Leu Lys Arg 
y 245 250 255 

Glu Ala Arg Ala Asn Gin Leu Ala Glu Glu He Glu Ser ser Ala Gin 
260 265 270 

Tvr lvs Ala Arg Val Ala Leu Glu Asn Asp Asp Arg Ser Glu Glu Glu 
y 275 280 285 

lvs Tvr Thr Ala Val Gin Arg Asn Cys Ser Asp Arg Glu Gly His Gly 
290 295 300 

Pro Asn Thr Arg Asp Asn Lys Tyr lie Pro Pro Gly Gin Arg Asn Arg 
305 310 315 320 

Glu val Leu Ser Trp Gly ser Gly Arg Gin Ser ser Pro Arg Met Gly 
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325 



revi sed . SUB . SEQUENCE . LST . 0120 
330 3 



335 



Gin Pro Gly Pro Gly Ser Met Pro ser Arg Ala Ala Ser His Thr ser 
340 345 350 

Asp Phe Asn Pro Asn Ala Gly Ser Asp Gin Arg val Val Asn Gly Gly 
355 360 365 

Val Pro Trp Pro Ser Pro Cys Pro Ser His Ser Ser Arg Pro Pro Ser 
370 375 380 

Arg Tyr Gin Ser Gly Pro Asn Ser Leu Pro Pro Arg Ala Ala Thr His 
385 390 395 400 

Thr Arg Pro Pro Ser Arg Pro Pro Ser Arg Pro Ser Arg Pro Pro Ser 
405 410 415 

His Pro Ser Ala His Gly Ser Pro Ala Pro Val Ser Thr Met Pro Lys 
420 425 430 

Arg Met Ser ser Glu Gly Pro Pro Arg Met Ser Pro Lys Ala Gin Arg 
435 440 445 

His Pro Arg Asn His Arg Val Ser Ala Gly Arg Gly Ser Met Ser Ser 
450 455 460 



Gly Leu Glu Phe Val Ser His Asn Pro Pro Ser Glu Ala Ala Ala Pro 
465 470 475 480 

Pro val Ala Arg Thr Ser Pro Ala Gly Gly Thr Trp Ser ser Val val 
485 490 495 

Ser Gly Val Pro Arg Leu Ser Pro Lys Thr His Arg Pro Arg Ser Pro 
500 505 510 

Arg Gin Ser Ser lie Gly Asn Ser Pro Ser Gly Pro Val Leu Ala Ser 
515 520 525 

Pro Gin Ala Gly lie lie Pro Ala Glu Ala val Ser Met Pro val Pro 
530 535 540 

Ala Ala Ser Pro Thr Pro Ala Ser Pro Ala Ser Asn Arg Ala Leu Thr 
545 550 555 560 

Pro Ser lie Glu Ala Lys Asp Ser Arg Leu Gin Asp Gin Arg Gin Asn 
565 570 575 

Ser Pro Ala Gly ser Lys Glu Asn val Lys Ala Ser Glu Thr Ser Pro 
580 585 590 

Ser Phe Ser Lys Ala Asp Asn Lys Gly Met Ser Pro Val Val Ser Glu 
595 600 605 

His Arg Lys Gin lie Asp Asp Leu Lys Lys Phe Lys Asn Asp Phe Arg 
610 615 620 

Leu Gin pro Ser Ser Thr ser Glu Ser Met Asp Gin Leu Leu Ser Lys 
625 630 635 640 

Asn Arg Glu Gly Glu Lys Ser Arg Asp Leu lie Lys Asp Lys Thr Glu 



645 



650 



655 
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revi sed . sub . sequence . lst . 0120 
Ala ser Ala Lys Asp Ser Phe He Asp Sen ser Ser Ser Sen ser Asn 
660 665 b/u 

cys Thr Ser Gly ser Ser Lys Thr Asn Ser Pro Ser lie Ser Pro Ser 
y 675 680 685 

Met Leu ser Asn Ala Glu His Lys Arg Gly Pro Glu val Thr ser Gin 
690 695 /uu 

Gly val Gin Thr ser ser Pro Ala Cys Lys Gin Glu Lys Asp Asp Arg 
705 710 715 

Glu Glu Lys Lys Asp Thr Thr Glu Gin Val Arg Lys Ser Thr Leu Asn 

725 730 /;5 -> 

Pro Asn Ala Lys Glu Phe Asn Pro Arg Ser Phe Ser Gin Pro Lys Pro 
740 745 'J" 

Ser Thr Thr Pro Thr ser Pro Arg Pro Gin Ala Gin Pro Ser Pro Ser 
755 760 /b: > 

Met val Gly His Gin Gin Pro Ala Pro val Tyr Thr Gin Pro val cys 
770 775 78° 

Phe Ala pro Asn Met Met Tyr Pro val Pro val ser Pro Gly val Gin 
785 790 795 ow 

Pro Leu Tyr Pro lie Pro Met Thr Pro Met Pro val Asn Gin Ala Lys 



805 



Thr Tyr Arg Ala Gly Lys val Pro Asn Met Pro Gin Gin Arg Gin Asp 
820 825 03 

Gin His His Gin ser Thr Met Met His Pro Ala ser Ala Ala Gly Pro 
835 840 ° 4 J 

Pro He val Ala Thr Pro Pro Ala Tyr Ser Thr Gin Tyr val Ala Tyr 
850 8 55 860 

ser Pro Gin Gin Phe Pro Asn Gin Pro Leu Val Gin His val Pro His 
865 8 70 87 5 eou 

Tyr Gin Ser Gin His Pro His val Tyr ser Pro val He Gin Gly Asn 
J 885 890 ey:> 

Ala Arg Met Met Ala Pro Pro Ala His Ala Gin Pro Gly Leu val ser 
900 905 yiu 

Ser ser Ala Ala Gin Phe Gly Ala His Glu Gin Thr His Ala Met Tyr 
915 920 y " 

Ala cys Pro Lys Leu Pro Tyr Asn Lys Glu Thr ser Pro Ser Phe Tyr 

930 935 y4U 

Phe Ala He Ser Thr Gly Ser Leu Ala Gin Gin Tyr Ala His Pro Asn 
945 950 955 

Ala Ala Leu His Pro His Thr Pro His Pro Gin Pro Ser Ala Thr Pro 



965 970 975 

980 



Thr Gly Gin Gin Gin ser Gin His Gly Gly ser His Pro Ala Pro Ser 
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Pro val Gin His His Gin His Gin Ala Ala Gin Ala Leu His Leu Ala 
995 1000 1005 

ser Pro Gin Gin Gin Ser Ala He Tyr His Ala Gly Leu Ala Pro Thr 
1010 1015 1020 

Pro Pro Ser Met Thr Pro Ala ser Asn Thr Gin Ser Pro Gin Ser Ser 
1025 1030 1035 ^tv 

Phe Pro Ala Ala Gin Gin Thr val Phe Thr He His Pro Ser Hi s val 

1045 1050 lu:>:) 

Gin Pro Ala Tyr Thr Thr Pro Pro His Met Ala His Val fro Gin Ala 
1060 1065 lU/u 

His val Gin ser Gly Met val Pro ser His Pro Thr Ala His Ala Pro 
1075 1080 10b b 

Met Met Leu Met Thr Thr Gin Pro Pro Gly Pro Lys Ala Ala Leu Ala 
1090 1095 1100 

Gin ser Ala Leu Gin Pro He Pro val ser Thr Thr Ala His Phe Pro 
1105 1110 ll 15 J " LZU 

Pro 
1125 



Tyr Met Thr His Proper Val Gin Ala Hi| His Gin Gin Gin Leu^ 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGCCCCTCA CCATGTCG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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CGGGCTTGCG GACATTGG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCGCGGCTGC CAATGTCC 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTAACCGTTC GGCGCCCG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGCTCCCGGC GGCTCCTT 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) molecule type: other nucleic acid 
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revi sed . SUB . SEQUENCE . LST . 0120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGCTGCTGCT GCTGGGGCTT CAG 
(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) length: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCGCCCGCTC CTCACGTGT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ACCCCCGAGA AAGCAACC 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCGTTGCCGT TGCTACCA 
(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 



18 
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revi sed . SUB . SEQUENCE . LST . 0120 
(c) strandedness: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TTCTCATGTG CGGCATCAAG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

val Tyr Gly Pro Leu Thr Met ser Leu Lys Pro Gin Gin Gin Gin Gin 
1 5 10 

Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
20 25 3U 

Gin Pro Pro Pro Ala Ala Ala Asn val Arg Lys Pro Gly Gly ser Gly 
35 40 4 -> 

Leu Leu Ala Ser Pro Ala Ala Ala Pro ser Pro ser ser ser ser val 
50 55 60 

ser ser Ser Ser Ala Thr Ala Pro ser ser val val Ala Ala Thr ser 
65 70 75 eu 

Gly Gly Gly Arg Pro Gly Leu Gly Arg Gly Arg Asn ser Asn Lys Gly 
3 85 90 

Leu Pro Gin ser Thr He ser Phe Asp Gly He Tyr Ala Asn Met Arg 
100 !0 5 iiU 

Met val His He Leu Thr ser Val Val Gly Ser Lys Cys Glu val Gin 
115 120 l" 

val Lys Asn Gly Gly He Tyr Glu Gly Val Phe Lys Thr Tyr Ser Pro 

Lys cys Asp Leu Val Leu Asp Ala Ala His Glu Lys Ser Thr Glu ser 
145 150 

ser ser Gly Pro Lys Arg Glu Glu He Met Glu ser lie Leu Phe Lys 
165 170 J-'* 

180 



20 



Cys ser Asp Phe val val val Gin Phe Lys Asp Met Asp ser ser Tyr 
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revised. SUB. SEQUENCE. LST. 0120 
Ala Lys Arg Asp Ala PheThr Asp sen Ala He ser Ala Lys val Asn 

195 200 £ ~ KJJ 



Gly Glu His Lys Glu Lys Asp Leu Glu Pro Trp Asp Ala Gly Glu Leu 

210 215 
Thr Ala Asn Glu Glu Leu Glu Ala Leu Glu Asn Asp val Ser Asn Gly 

225 230 

Trp Asp Pro Asn Asp Met Phe Arg Tyr Asn Glu Glu Asn Tyr Gly val 

val ser Thr Tyr Asp ser ser Leu Ser ser Tyr Thr val Pro Leu Glu 
260 265 

Arg Asp Asn ser Glu Glu Phe Leu Lys Arg Glu Ala Arg Ala Asn Gin 

275 2oO *- OJ 

Leu Ala Glu Glu He Glu ser ser Ala Gin Tyr Lys Ala Arg val Ala 

290 295 
Leu Glu Asn Asp Asp Arg Ser Glu Glu Glu Lys Tyr Thr Ala val Gin 
305 310 

Arg Asn ser ser Glu Arg Glu Gly His Ser He Asn Thr Arg Glu Asn 

Lvs Tyr lie Pro Pro Gly Gin Arg Asn Arg 
340 345 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 amino acids 

(B) TYPE: amino acid 

(c) strandedness: single 

(d) topology: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



His Glu Gly Pro Leu Thr Met ser Leu Lys Pro Gin Pro Gin Pro Pro 

1 5 1U 

Ala Pro Ala Thr Gly Arg Lys Pro Gly Gly Gly Leu Leu Ser ser Pro 
20 25 3 

Gly Ala Ala Pro Ala ser Ala Ala val Thr Ser Ala Ser val val Pro 

35 40 ^ J 

Ala Pro Ala Ala Pro val Ala ser Ser Ser Ala Ala Ala Gly Gly Gly 

50 55 D 

Arg Pro Gly Leu Gly Arg Gly Arg Asn Ser ser Lys Gly Leu Pro Gin 
65 ™ 

Pro Thr He Ser Phe Asp Gly He Tyr Ala Asn val Arg Met val His 

85 90 ,_ 
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He Leu Thr ser val val Gly ser Lys Cys Glu val Gin val Lys Asn 
100 105 

Gly Gly He Tyr Glu Gly val Phe Lys Thr Tyr ser Pro Lys Cys Asp 

H5 120 X/LJ 

Leu val Leu Asp Ala Ala His Glu Lys ser Thr Glu ser Ser ser Gly 

130 135 
Pro Lys Arg Glu Glu He Met Glu ser val Leu Phe Lys Cys Ser Asp 

145 150 x:>:5 

Phe val val val Gin Phe Lys Asp Thr Asp ser Ser Tyr Ala Arg Arg 

165 170 
Asp Ala Phe Thr Asp Ser Ala Leu ser Ala Lys val Asn Gly Glu His 



180 



Lys Glu Lys Asp Leu Glu Pro Trp Asp Ala Gly Glu Leu Thr Ala Ser 



195 



Glu Glu Leu Glu Leu Glu Asn Asp val ser Asn Gly Trp Asp Pro Asn 

210 215 
Asp Met Phe Arg Tyr Asn Glu Glu Asn Tyr Gly val Val ser Thr Tyr 

225 230 " 3 

Asp Ser ser Leu ser Ser Tyr Thr val Pro Leu Glu Arg Asp Asn Ser 

245 /:>u 

Glu Glu Phe Leu Lys Arg Glu Ala Arg Ala Asn Gin Leu Ala Glu Glu 
260 265 

He Glu ser ser Ala Gin Tyr Lys Ala Arg val Ala Leu Glu Asn Asp 

280 ° 



275 



Asp Arg ser Glu Glu Glu Lys Tyr Thr Ala Val Gin Arg Asn Cys Ser 

Asp Arg Glu Gly His Gly Pro Asn Thr Arg Asp Asn Lys Tyr He Pro 
305 310 
pro Gly Gin Arg Asn Arg 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE characteristics: 

(a) LENGTH: 275 amino acids 
(B) type: amino acid 
(c) strandedness: single 
(D) TOPOLOGY: linear 

(ii) molecule type: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Leu Ala Pro Gin Pro Pro Pro Pro Gin Gin His Gin Glu Arg Pro Gly 

1 5 10 
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revi sed . SUB . SEQUENCE . LST . 0120 

Ala Ala Ala He Gly ser Ala Arg Gly Gin Ser Thr Gly Lys Gly Pro 
20 25 

Pro Gin ser Pro val Phe Glu Gly val Tyr Asn Asn Ser Arg Met Leu 

35 40 * J 

His Phe Leu Thr Ala val val Gly ser Thr cys Asp val Lys val Lys 

50 55 D 

Asn Gly Thr Thr Tyr Glu Gly He Phe Lys Thr Leu ser ser Lys Phe 
65 70 

Glu Leu Ala val Asp Ala val His Arg Lys Ala Ser Glu Pro Ala Gly 



85 



Gly Pro Arg Arg Glu Asp He val Asp Thr Met val Phe Lys Pro ser 
' 100 105 

Asp val Met Leu val His Phe Arg Asn val Asp Phe Asn Tyr Ala Thr 

r 120 ±£ - J 

Lys Asp Lys Phe Thr Asp Ser Ala He Ala Met Asn Ser Lys val Asn 

130 135 
Gly Glu His Lys Glu Lys val Leu Gin Arg Trp Glu Gly Gly Asp ser 

145 150 J -- ) -' 

Asn ser Asp Asp Tyr as P Leu Glu Ser Asp Met Ser Asn Gly Trp Asp 



165 



Pro Asn Glu Met Phe Lys Phe Asn Glu Glu Asn Tyr Gly Val Lys Thr 

180 185 
Thr Tyr Asp ser Ser Leu ser Ser Tyr Thr val Pro Leu Glu Lys Asp 

195 200 
Asn ser Glu Glu Phe Arg Gin Arg Glu Leu Arg Ala Ala Gin Leu Ala 

210 215 
Arg Glu He Glu ser ser Pro Gin Tyr Arg Leu Arg He Ala Met Glu 
225 ^30 " 3 



Asn Asp Asp Gly Arg Thr Glu Glu Glu Lys His ser Ala val Gin Arg 
Gin Gly ser Gly Arg Glu Ser Pro ser Leu Ala ser Arg Glu Gly Lys 



Tyr lie Pro 
275 

(2) INFORMATION FOR SEQ ID NO: 19: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 
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AAT GAC ATG TTT CGA TAT AAT GAA GAG AAT TAT GGT GTG OTG TCC ACA 

Asn Asp Met Phe Arg Tyr Asn Glu Glu Asn Tyr Gly val val ser Thr 

--- 230 



225 



tax tat LCC ACT TTA TCT TCA TAT ACG GTT CCT TTA GAA AGG GAC AAC 

Ty J r tsl sfr tfr Veu ITr ler Tyr Thr val Pro Leu Glu Arg Asp Asn 
240 245 250 ^ 

TCA GAA GAA TTT CTT AAA CGG GAG GCA AGG GCA AAC CAG TTA GCA GAA 

ser Glu Glu Phe Leu Lys Arg Glu Ala Arg Ala Asn Gin Leu Ala Glu 
260 265 z/u 

GAA ATT GAA TCC AGT GCT CAG TAC AAA GCT CGT GTC GCC CTT GAG AAT 
GlS iVe Glu ser Ser Ala Gin Tyr Lys Ala Arg val Ala Leu Glu Asn 
275 280 zo:> 

CAT GAC CGG AGT GAG GAA GAA AAA TAC ACA GCA GTC CAG AGA AAC TGC 

Isl Asp A% Ifr Glu Glu Glu Lys Tyr Thr Ala val Gin Arg Asn cys 
290 295 3UU 

act GAC CGG GAG GGG CAT GGC CCC AAC ACT AGG GAC AAT AAA TAT ATT 

SeT Asp Arg GlS tfy His Gly Pro Asn Thr Arg Asp Asn Lys Tyr He 
- -- 310 



305 



rn CCT GGA CAA AGA AAC AGA GAA GTC CTA TCC TGG GGA AGT GGG AGA 

P% pS Gl? GlS Arg A^n Arg Glu val Leu Ser Trp Gly Ser Gly Arg 
320 325 

TAC AGC TCA CCA CGG ATG GGC CAG CCT GGG CCA GGC TCC ATG CCG TCA 

Gin ser ser SS Arg Met Gly Gin Pro Gly Pro Gly ser Met Pro Ser 

340 345 « u 

AGA GCT GCT TCT CAC ACT TCA GAT TTC AAC CCG AAC OCT GGC TCA GAC 



Arg SI Si £!■ ifi Tnr s£ Asp e K5 Asn AU Gly ser Asp 

355 360 

taa AGA GTA GTT AAT GGA GGT GTT CCC TGG CCA TCG CCT TGC CCA TCT 

Gl£ A?g Si vIT iJI Gly Gly val Pro Trp Pro ser Pro cys Pro Ser 
370 375 *w 

rr TCT CGC CCA CCT TCT CGC TAC CAG TCA GGT CCC AAC TCT CTT 

P?o sir Hr A% Pro p7o Ser Arg Tyr Gin ser Gly Pro Asn ser Leu 

385 390 
rr* rr-r rrc ccb CCC ACC CCT ACA CGG CCT CGT GCC GAA TTC CTG CAG 

irl Art Ala" Ala Thr p?o Thr" Arg Pro Arg Ala Glu Phe Leu Gin 
400 405 41U 

CCC GGG GAT CC 
Pro Gly Asp 



718 



766 



814 



862 



910 



958 



1006 



1054 



1102 



1150 



1198 



1246 



1257 
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